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ABSTRACT 

In this paper we examine the percentage of type 1 AGN which require the inclusion of 
a soft excess component and/or significant cold absorption in the modelling of their 
X-ray spectra obtained by XMM-Newton. We do this by simulating spectra which 
mimic typical spectral shapes in order to find the maximum detectability expected at 
differe nt count leve l s. We then apply a correction to the observed percentages found 
for the lScott et al.l (|201l[) sample of 761 sources. We estimate the true percentage of 
AGN with a soft excess component to be 75 ± 23%, suggesting that soft excesses are 
ubiquitous in the X-ray spectra of type 1 AGN. By carrying out joint fits on groups 
of low count spectra in narrow z bins in which additional spectral components were 
not originally detected, we show that the soft excess feature is recovered with a mean 
temperature kT and blackbody to power-law normalisation ratio consistent with those 
of components detected in individual high count spectra. Cold absorption with Nn 
values broadly consistent with those reported in individual spectra are also recovered. 
We suggest such intrinsic cold absorption is found in a minimum of ~ 5% of type 1 
AGN and may be present in up to ~ 10%. 
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1 INTRODUCTION 

The X-ray spectral properties of AGN classified optically 
as type 1 have b een recently exten s ively studied (e.g . 
Young et all 120091 , iMateos et ail |2010l ICorral et all l201ll . 



Scott et al .Il201ll b The underlying spectrum in the 0.5-12.0 
keV band consists of a power law of photon index r ~ 2 
thought to be produced by the inverse Compton scatter- 
ing of low e nergy disc photons by a corona of relativis- 
tic electrons l|Haardt fc M araschi 1993). In higher quality 
spectra, a soft excess component is detected , rising above 
the p ower law at rest-frame energies <1 keV l|Arnaud et al.l 
1985). This was originally interpreted as the hard tail of 
the 'Big Blue Bump', accretion disc emission seen in the 
UV, but is now thought perhaps to b e an artifact of ionised 
absorption dGierliriski fc Pond l2004h or i onised reflection 
l|Ross fc Fabianll2005l : ICrummv et alj|2006h . Lower energies 
are also affected by photoelectric absorpti on, although the 
standard orientation based Unified Model l|Antonuccilll993D 
does not predict any intrinsic X-ray absorption to be present 
in objects which have been classified as type 1 due to the 
presence of broad lines in their UV/optical spectra. 

There have been many studies in which some type 
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1 objects have shown evidence for intrinsic X-ray absorp- 
tion. The typical percentage of such objects is ~ 10% 
with many studies quoting this similar valu e dPage et al 



2003; Perola et al. 2004; Piconcelli et al. 2005; Mateos et al 
2005b): iMainieri et al.ll2007l; iGarcet et alj|2007t lYoung et al 
20091 : (Mateos et al.ll2010l : ICorral et al.ll201ll ). 



There are a large range of values quoted for the percent- 
age of type 1 sources which include a soft excess. The ear- 
liest study with EXOS AT suggested 30-50% o f objects in- 
cluded the component dTurner fc Pound s 1989) and a study 
with ASCA found ~ 40% (|Reeves fc Turner! l200Ch . It has 
also been suggested that the soft excess ma y be a ubiqui- 
tous f eature in optically sele cted PG quasars l|Porquet et al.l 
120041 : iPiconcelli et al.l [2005h . however these samples are bi- 
ased towards bright and low redshift sources. The quoted 
percentage of soft excesses can be very different depend- 
ing upon the redshi ft range being considered. For example, 
IMateos et all (|2010h find a percentage of only 8% when con- 
sidering their entire sample, but this is increased to 36% 
when only sources at z < 0.5 are considered. Similarly the 
CAIXA sample of XM M-Newton target sou rces finds a high 
percentage of ~ 80% jBianchi et all 120091 '). This could be 
because the sample is biased towards low redshift objects 
and/or good quality spectra in which detecting the spectral 
component is easier. Clearly, in order to determine whether 
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the soft excess is present in all sources, the influences of 
redshift and spectral quality need to be taken into account, 
using a sample which covers a large range in these proper- 
ties. 

In a rece nt study of t h e X-ra y spectral properties of 761 
type 1 AGN. IScott et al.1 (|201ll : hereafter Sll), find ~ 8% 
of their sample require a soft excess component and ~ 4% 
require intrinsic cold absorption in the modelling of their X- 
ray spectra. It was noted that these values represent lower 
limits on the intrinsic percentage of sources which include 
such components, since their detectability is limited by the 
quality of the spectra. In this paper we follow on from this 
analysis and aim to deduce how common these spectral fea- 
tures really are. In the case of the soft excess we do this by 
simulating typical spectra at different count levels in order 
to determine the maximum detection rate expected for such 
an additional spectral feature. This can then be compared 
to the observed results in order to determine the intrinsic 
percentage. The original data sample is described in §3] The 
soft excess simulations and a joint fitting of multiple low 
count spectra in an attempt to recover the soft excess are 
described in §31 Section [5] considers the presence of absorp- 
tion components. We discuss our results in §5]and summarise 
our conclusions in 



2 DATA 

The Sll sample was created b y cross-correlati n g the 
SDSS DR5 quasar catalogue i Schneider et al.1 120071 ) 
and the serendipito us X-ray source catalogue, 2XMMi 
i|Watson et all l2009h . The X -ray spectra were extracted 
using standard SA£|j tasks and fit using xspec vll.3.2 
l|Arnaudl [l996) over the energy range 0.5-12.0 keV. His- 
tograms showing the redshift and net (i.e. background sub- 
tracted) counts distributions of the sources can be found 
in Fig. [1] All sources have >75 combined MOS+pn counts, 
allowing both a simple power law and an absorbed power 
law to be fit, with the power-law slope, T, allowed to vary 
freely. For 680 sources with >100 counts, models including a 
blackbody component were also considered in order to model 
any soft excess. The best-fitting model was assumed to be 
a simple power law unless the F-test, used at the 99% sig- 
nificance level, determined that additional components were 
statistically required. A summary of the different models 
considered, can be found in Table [1] along with the number 
of sources best-fit with each. Soft excesses are found in ~ 8% 
of the sources in the sample and ~ 4% require intrinsic ab- 
sorption. An additional Galactic absorption component was 
included in each m odel, fixed at the Nh valu e determined 
from the Hi map of iDickev fc Lockmanl (|l990l ). 

Fig. [2] shows the detected percentages of the soft excess 
(thick, blue) and absorption (red) components as a func- 
tion of the number of counts in the spectra. The detected 
percentage of the additional components is much lower in 
spectra with low counts where the statistics are poorer and 
the features are not detected with enough significance. It 
was suggested in Sll that since at the highest count levels 

1 The description and documentation are available online at 
://xmm.esac.esa.int/sas/| 




Figure 1. Histograms showing the net counts and redshift dis- 
tributions of the sample used here, and described further in 
IScott et al.l 11201111 . 



Table 1. The different models used to fit the X-ray spectra 
of the sources in the Sll sample. 



Model 


Spectral Components 


Sources 


po 


Power Law 


672 


apo 


PL + absorption 


29 


po+bb 


PL + soft excess 


55 


apo+bb 


PL + absorption + soft excess 


5 


761 



we might expect to be able to detect all soft excess com- 
ponents if they are present, the intrinsic percentage could 
be as high as the ~ 80% found in the highest count bin, 
making soft excesses common in the X-ray spectra of type 

1 AGN. Fig. [2] also shows intrinsic absorption detected in 
up to ~ 25% of sources in the higher count bins. Since the 
sample is drawn from a population of type 1 AGN, such a 
component is not expected to be required in the modelling of 
their X-ray spectra. The F-test was used at 99% significance 
when choosing the best-fitting model for a particular source, 
therefore 1% of the detections of a specific spectral compo- 
nent can be considered spurious. This 1% level is shown by 
the dashed line in Fig. [2] 

The sample contains 62 sources which were the target of 
an XMM-Newton observation and therefore generally con- 
tain more counts in their X-ray spectra than the serendip- 
itously detected sources. The percentage of targets in each 
count bin is shown in Fig.[2]and increases towards the higher 
count bins as expected, with the top bin including almost 
only target sources. These sources could bias the detection 
percentages if they were selected for observation due to a 
previously known soft excess or intrinsic absorption. We 
therefore exclude the target sources from our subsequent 
analysis, leaving 699 sources in the sampk{3- 

For sources at increasing redshifts, the contribution of a 
soft excess component or absorption in the spectra will grad- 
ually decrease as a larger contribution is redshifted outside 
of th e XMM-Newton EP IC instrument bandpass (I Turner! 
l200ll ; IStriider et aI1l200lh . Therefore the detected percent- 
ages of these components are expected to be higher in bins 
containing mostly low redshift sources, which tend to be the 
bins with higher numbers of counts, hence the higher de- 
tection rate may be due to this redshift bias. The average 
redshift of the sources in each count bin is plotted in Fig. [2] 

2 699 sources have > 75 counts and 619 have > 100 counts which 
we fit with models including a soft excess. 
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Figure 2. The top plot shows how the percentage of sources 
which require an absorption or soft excess component varies de- 
pending on the number of counts (MOS+pn) that are available in 
the spectra. The lowest count bin includes sources with between 
75—320 total counts for absorption and 100-320 total counts for 
the soft excess, since these were the minimum numbers of counts 
required for fitting that particular spectral component. The sam- 
ple includes 62 sources which were the target of an XMM-Newton 
observation, rather than a serendipitous detection. The middle 
plot shows the percentage of sources within each count bin which 
are target sources. The errors in the top two plots have been cal- 
culated from Poissonian statistics. The bottom plot shows how 
the average redshift of the sources in each count bin varies and 
includes standard errors on the mean. 



and does decrease with increasing counts as expected. How- 
ever we note that within each count bin the sources do cover 
a large range in redshifts. 

To further investigate the redshift issue, the sources are 
split into broad redshift bins of z < 1 and z > 1. Separate 
detection curves are created and are shown in Fig. [3] In the 
case of the absorbed sources (top), the detected percentages 
for the low and high redshift sources appear to be consistent 
within the error ranges. This is likely due to the large range 
of JVh values found in the sample, which means that we are 
able to detect absorption in sources at a range of redshifts. 
The percentage of sources detected with a soft excess com- 
ponent is higher in the low redshift sample than in the high 
redshift sample as expected. The curve for the z > 1 sources 
shows gaps where the detected fraction falls to zero, due to 
the low numbers of sources; although soft excesses are de- 
tected in the sample up to z = 2, the majority (82%) are 
found in sources with z < 1 as expected. 



Absorption, z<1.0 
Absorption, z> 1 .0 



1' 



Soft Excess, z<1 ,0 
Soft Excess, z>1 ,0 



10 4 



Figure 3. These figures show how the detected fraction of ab- 
sorption (top) and soft excess (bottom) components varies with 
the numbers of counts in the spectra, considered in broad redshift 
bins of z < 1 and z > 1. There is a notable difference between 
the two curves in the bottom panel indicating a strong redshift 
dependence on the detectability of soft excess components. 



3 SOFT EXCESS COMPONENTS 
3.1 Intrinsic Percentage 

In order to determine the intrinsic percentage of sources 
with a soft excess component, we carry out multiple sets 
of simulations in order to quantify their detectability. This 
is done by finding the maximum percentage of components 
that are expected to be detected in spectra with different 
numbers of counts. 

At each of 5 redshifts and between 7 and 11 different 
count levels, we simulate 1000 spectra which include a soft 
excess. Each spectrum is fit over the energy range 0.5-12.0 
keV with the 'po' and 'po+bb' models, and the F-test is 
used at 99% significance to determine whether the compo- 
nent is statistically required. The percentage of sources in 
which we significantly detect the component is then deter- 
mined. By repeating this procedure with sets of spectra at 
different count levels and redshifts we construct synthetic 
detectability curves from which we can determine the max- 
imum detection percentage at any count level. 

We create our spectra, using the fake it command in 
XSPEC which distributes a given number of counts, con- 
trolled by varying the exposure time, around a defined model 
with statistical fluctuations and assigns them Poissonian er- 
rors. We define the model such that it mimics the shape 
of a typical source in the Sll sample, both in terms of 
the shape of the components i.e. the F and kT values and 
the size of the components, particularly the ratio of the 
blackbody normalisation to that of the power-law normali- 
sation since this will also determine how easy the blackbody 
is to detect over the power-law continuum. 55 sources re- 
quired the 'po+bb' model in the original Sll analysis. The 
distribution of power-law slopes and kT values for these 



4 A. E. Scott, G. C. Stewart & S. Mateos 




Counts 



Figure 4. The solid black line with error bars shows the observed 
percentage of soft excess components as a function of counts for 
the Sll sample after the target sources have been removed, which 
required a rebinning of the sources. The coloured curves show the 
detectability level for a soft excess component with a typical shape 
and size. Sets of simulations are carried out at different redshifts 
and count levels as shown by the coloured error bars. The curves 
are created from a quadratic fit to at least 7 such results (a linear 
fit is used for z = 1.3). 



sources were each fit with a Gaussian, and the best-fitting 
mean values with dispersions were F = 1.79 ± 0.46 and 
kT = 0.17 ± 0.08 keV. We therefore simulate sources with 
r = 1.8, kT = 0.2 keV and fix the normalisation ratio to 
the median value of 0.04. These values are intended to rep- 
resent a 'typical' source rather than the full range of values, 
although the distributions of kT and normalisation ratio are 
reasonably narrow. As shown in fig. 22 of Sll, there is a very 
tight correlation between the luminosities of the blackbody 
and power-law components. The median normalisation ratio 
of 0.04 used here corresponds to a luminosity ratio of ~ 0.2. 

As has been previously discussed, the detectability of 
soft excess components is strongly dependent upon the red- 
shift of the source. Therefore we run sets of simulations at 
a range of redshifts (z — 0.4, z = 0.75, z = TO, z — 1.3 and 
z — 1.5) and create five separate detectability curves shown 
by the different lines in Fig. [4] For interpolation purposes 
they are fit with a quadratic function which describes four 
of the curves well. The curve for z — 1.3 has a slightly differ- 
ent curvature from the asymptotic behaviour expected and 
therefore in this case we use a linear function to fit the sim- 
ulated points. The curves indicate the detection percentage 
that we would expect to see for a particular count level if all 
sources contained this component. If the observed percent- 
ages were consistent with this line, it would indicate that 
the spectral features are present in all of the sources in our 
sample and it is merely the quality of the spectra that lim- 
its our ability to detect them. For sources at z < 1 with 
> 10000 counts, the detectability curves lie at 100%. This 
suggests that the intrinsic percentage could be as high as 
~ 70%; the percentage found in the highest count bin (after 
removal of the target sources). We can determine the true 
soft excess detection percentage (corrected percentage) for 
each count bin in our data by dividing the observed detec- 
tion percentage by the maximum percentage obtained from 



our simulated curves which is fixed to lie between the spu- 
rious level, 1%, and 100%. 

As each bin includes sources at a range of redshifts, 
we calculate a corrected percentage using all five of the de- 
tectability curves and determine a weighted average value 
according to the equation; 

Result = (weight x corrected percentage) (1) 

where the 'weight' is the fraction of sources for which each 
particular redshift curve is appropriate for within that count 
bin. For sources at z > 2, the maximum percentage is fixed 
at 1%, since no real soft excesses are expected, but spurious 
detections may occur. Since the count bins are broad and the 
sources are not distributed evenly within them, they are fur- 
ther divided into sub-bins. The redshift-corrected percent- 
age is calculated for each as outlined above, and an overall 
corrected percentage is reconstructed for the full count bin 
using equation [T] where 'weight' in this case is the fraction 
of sources within the particular count sub-bin. 

The dashed blue line in Fig. [S] shows the corrected per- 
centagefl The values are roughly constant after the effect 
of spectral quality has been removed. In order to determine 
a value for the intrinsic percentage, x 2 is calculated for con- 
stant percentages between and 100%. The minimum \ 2 
occurs for a constant percentage of 75 ± 23%, for which the 
fit has a null hypothesis probability of p — 91%. This method 
produces a corrected percentage greater than 100% in the 
3rd bin. Whilst this has no physical meaning, its large error 
bar makes it entirely consistent with the corrected percent- 
ages obtained for the other bins. Capping this bin at 100% 
reduces our intrinsic percentage estimate by only 1%. 

To test whether a 1% spurious level is appropriate we 
simulated spectra which did not include a soft excess, and 
determined in how many we incorrectly detected this com- 
ponent with > 99% significance. For the majority of count 
levels this percentage was consistent with, or lower than 1%. 
However, for the lowest and highest count levels (< 100 and 
~ 50,000) the spurious level was somewhat higher (up to 
~ 5%), suggesting that both the usual statistical problems 
affecting simulated fits and some small systematic calibra- 
tion errors exist. The count bin most affected by the spu- 
rious level is the lowest, as the simulated curve for z — 1.5 
sources falls below 1%. If the maximum percentage from the 
simulated curve is fixed at 1%, the spurious level, the correc- 
tion factor is under-estimated. Excluding this bin from the 
intrinsic percentage determination gives 7ll24%> which is 
consistent with the value obtained when the bin is included. 



3.2 Joint Spectral Fitting 

We have shown that after correcting for spectral quality, a 
soft excess component may be ubiquitous in the Sll sam- 
ple. We now re-analyse spectra with low numbers of counts 



3 We note that percentages > 100% are a possible consequence of 
using this correction factor method. The count bins have been re- 
defined in order to reduce this statistical effect. However, should 
the intrinsic percentage be exactly 100%, we would expect some 
bins to show correction percentages lower, and others greater than 
100%. 
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Figure 5. The blue solid line shows the observed percentage 
of soft excess components in the Sll sample (after the target 
sources have been removed). The dashed line shows the detected 
percentage determined after the correction for spectral quality. 
The grey bar indicates the location of the intrinsic percentage of 
sources with soft excesses, where the width of the bar represents 
the la error boundary. 

in a joint fitting to see if the soft excess feature can be re- 
covered in a combination of spectra where it was previously 
undetected. 

The Sll sample includes 436 sources which were origi- 
nally fit with a simple power-law model i.e. the soft excess 
feature was not detected in their spectra with > 99% signif- 
icance, and which have < 500 counts. This limit is imposed 
so that the joint fitting is not dominated by a single object 
with high numbers of counts. Samples of ~ 30 sources (~ 50 
MOS and/or pn spectra) are created which cover a narrow 
range in redshift. The resulting groups of spectra include a 
total of ~ 7000 counts, a level at which a soft excess in a 
z < 1 source is expected to be detected 100% of the time if 
it is present. This is shown in Fig. |4]where the detectability 
curves for z = 0.4 and z = 0.75 sources are at 100% at a 
7000 count level. 

The groups of spectra are jointly fit with the simple 
power law model, 'po', in which Y is free to vary. In each 
case we find a best-fitting result of T ~ 2. In the case of 
the lowest two redshift bins the fit is significantly improved 
by using the 'po+bb' model, both in terms of a lower X* /v, 
as listed in Table [2j and an F-test comparing the two mod- 
els which equals 100% in both cases. However the F values 
are lower than 1.8, the value used in the simulations of the 
previous section. At lower redshifts, we expect to be more 
sensitive to cold absorption (see Sj4} and we note that if a 
fixed absorption component of Nu = 10 21 cm -2 is included 
in the fit, steeper values of F ~ 1.70±0.05 are obtained. This 
does not fully correct for the effect as the resulting contin- 
uum from stacking spe ctra with different le vels of absorption 
is known to be flatter (|Mateos et al.ll2010l ). 

We also fit each group of spectra with the 'po+bb' 
model in which V is fixed at 1.8 and kT is tied to the 
same value for each of the spectra. We determine a single, 
best-fitting normalisation ratio common to all the spectra by 
fixing the power-law normalisations at 10 -5 and tying the 
blackbody normalisations for each spectrum to a common 
value. A freely varying constant is added to the model to 



allow each data set to vary independently. The results can 
be found in Table [2] listed as the model 'po+bb fix A'. For 
all except the 1st bin we find average normalisation ratios 
which are consistent with the value of 0.04 used in the sim- 
ulations of the previous section. For all except the 5th bin, 
the kT values are consistent with those found in individual 
sources with high count spectra. 

Each group of spectra is also fit with the 'po+bb' model 
in which V is fixed at 1.8, kT is tied to a common value for 
each of the spectra and the normalisations of both the black- 
body and power-law components are left free to vary. These 
results are listed in Table f2] as the model 'po+bb fix B', 
and include the median value of the blackbody to power- 
law normalisation ratios and the percentage of spectra for 
which the ratio is > 0.01, which is interpreted as a black- 
body component being present. For the three lower redshift 
bins (z < 1) the inclusion of a blackbody component in the 
spectral fit results in a better fit and the blackbody temper- 
atures are consistent with the previous values obtained from 
the individual high count spectra. The normalisation ratios 
of the blackbody to power-law components are also consis- 
tent, both in terms of the median value and a Kolmogorov- 
Smirnov (KS) test which finds the full distribution of ratios 
to be not significantly different from those found in single 
object spectra in Sll. The median normalisation ratio of 
~ 0.04 is consistent with the value used in the simulations 
of the previous section and corresponds to a luminosity ra- 
tio of ~ 0.2. In addition, the percentage of spectra which 
do include a blackbody component (since the fit allows a 
blackbody normalisation of zero for individual spectra) is 
consistent with the intrinsic percentage calculated in H3.1\ 
This model also gives a better fit for the highest of the red- 
shift bins. However, the kT value is too high to be consistent 
with the temperatures observed in individual spectra. 

It has been suggested that the soft excess feature is 
ubiquitous in high accretion rate AGN and that it is this 
parameter which may determine its presence in the spec- 
tra, or the size of the component (e.g. lDone et alj|2012l ). We 
use the Eddington ratio as a proxy for mass accretion rate 
defined as AEdd = log(Z/boi/-£'Edd)- The bolometric luminos- 
ity was estimated from the observed X-ray luminosity in 
the 2-10 keV band, measured from the best-fitting spectral 
model, by app l ying t he luminosity-dependent correction of 
iMarconi et alj (2004). The Eddington luminosity was esti- 
mated u sing virially deter mined black hole mass estimates 
from thelh en et al.l (|2008l ) catalogue. As before, we use only 
sources with < 500 total counts and those best-fit with the 
model 'po'. In addition, we also further restrict the sources 
to those at z < 1 and we split the remaining sources into 
3 sub-samples containing roughly equal numbers of sources. 
The samples are fit with both the 'po+bb fix A' and 'po+bb 
fix B' models, the results of which are listed in Table[3]along 
with the properties of each sample. The 'po+bb fix A' model 
finds that a best-fitting normalisation ratio is consistent with 
the 0.04 found in single spectra for the top 2 bins. In the 
case of the lowest AEdd bin this value is considerably lower, 
although it has a large associated error. A KS test finds the 
distributions of the normalisation ratios determined from 
the 'po+bb fix B' model to be consistent, however the me- 
dian values do vary from a lower value than expected in the 
low AEdd bin to a higher value than expected in the high 
AEdd bin. In addition, the percentage of spectra for which 
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z range 



No. of sources 
(spectra)" 



Total 
Counts 



Model 



X 2 /" 



kT 

(keV) 



Normalisation Percentage 
Ratio with SE 



0.21-0.55 



0.56-0.77 



0.77-0.98 



0.98-1.12 



1.12-1.19 



1.19-1.28 



34 (51) 



30 (50) 



33 (51) 



31 (51) 



30 (50) 



29 (50) 



8078 



7661 



7367 



7583 



6749 



5614 



po 

po+bb 
po+bb fix A 
po+bb fix B 

po 

po+bb 
po+bb fix A 
po+bb fix B 

po 

po+bb 
po+bb fix A 
po+bb fix B 

po 

po+bb 
po+bb fix A 
po+bb fix B 

po 

po+bb 
po+bb fix A 
po+bb fix B 

po 

po+bb 
po+bb fix A 
po+bb fix B 



859/636 
664/584 
873/635 
675/585 

861/615 
682/564 
854/614 
693/565 

683/611 
624/559 
681/610 
579/560 

632/629 
576/577 
616/628 
629/578 

604/578 
571/527 
597/577 
606/528 

485/509 
450/458 
471/508 
416/459 



1.351) 
1.137) 
1.375) 
1.154) 

1.400) 
1.209) 
1.391) 
1.227) 

1.118) 
1.116) 
1.116) 
1.034) 

1.005) 
0.998) 
0.981) 
1.088) 

1.045) 
1.083) 
1.035) 
1.148) 

0.953) 
0.983) 
0.927) 
0.906) 



2 10+ 004 

i ro+0.08 
L -°°-0.06 

1.8 (fixed) 
1.8 (fixed) 

r. n7 +0.05 

z ' u ' -0.05 
1 62+ 08 

1.8 (fixed) 

1.8 (fixed) 



3+0.05 
-0.04 
i+0.04 
-0.05 



2.03Z 

1.98j 
1.8 (fixed) 
1.8 (fixed) 

9 n9 +0.05 
z ' u -0.04 

1 07+0.04 

x - y ' -0.06 

1.8 (fixed) 
1.8 (fixed) 

2 03+ ' 04 
z - u 'J-0 .05 

1 QS+ - 04 

1.8 (fixed) 
1.8 (fixed) 



1.95Z 
1.89"* 



-+0.05 
-0.05 
-,+0.05 
-0.06 

1.8 (fixed) 
1.8 (fixed) 



23+ - 01 
u - zo -0.03 



0.23 
0.19 



+0.03 
0.03 
+0.02 
0.02 



24+ 03 
u - z4 -0.02 

1Q+ 004 
u - iM -0.04 

-,+0.03 



0.20: 



0.02 



n 044+ ' 002 

n o7 +0.05 
u ' z '-0.05 
s+0.03 



0.28: 



0.04 



n n5 o + 0.003 

u.uoo_ 002 

26+ 04 
u - ZD -0.04 

0.02 6 



06+ 01 
u - UD -0.03 

n ocr+0.05 

u - c>o -0.05 

065+ 005 

U.UDO_ 005 



0.062 
0.24 
0.34 



+0.002 
0.002 
+0.06 
0.06 
+0.07 
0.06 



0.07 78 ± 17 
0.024 ± 0.002 

0.04 74 ± 16 

0.07 84 ± 18 
0.040 + 0.010 

0.05 72 ± 16 



0.036 + 0.005 
0.06 



0.048 ± 0.008 



0.045 ± 0.005 



63 + 14 



0.050 ± 0.030 

0.04 66 ± 15 



Table 2. The samples used for the joint fitting and the results from fitting the spectra with the simple power law model 'po' in which 
T is allowed to vary freely and the 'po+bb' model in which T is both fixed at 1.8 and free to vary, 'po+bb fix A' refers to the model in 
which the blackbody to power-law normalisation ratio is kept the same for each spectrum and 'po+bb fix B' is the model in which both 
the blackbody and power-law normalisations are free to vary. In the case of model A, the normalisation ratio quoted is the best-fitting 
value with a 68% error and for model B it is median value. The errors on the kT and T parameters are 90%. Notes: a Not all sources 
have both a MOS and a pn spectrum available. 'For this sample we are unable to estimate reliable errors from the fit. 



the normalisation ratio is > 0.01 is consistent with the in- 
trinsic percentage in the case of the top 2 bins, but is lower, 
although still within errors, for the lowest AEdd bin. Whilst 
the evidence is not strong, this may suggest that soft ex- 
cesses are smaller and less common in sources with lower 
accretion rates and we do not rule this out. 



4 ABSORPTION COMPONENTS 

Intrinsic cold absorption may also be present in type 1 AGN, 
suppressing the lower energy emission. The percentage of 
sources with detected absorption is shown in Fig. [2] and ap- 
pears to be limited by the spectral quality in a similar way to 
that of the soft excess. However, the effect is not as strong, 
with the detected percentage decreasing by approximately 
20% from the higher to lower count bins rather than ~ 80% 
in the case of the soft excess. It was suggested in Sll that the 
true percentage of absorbed sources could be as high as the 
~ 25% found in the highest count range, where we might ex- 
pect the spectra to be of good enough quality to detect any 
significant absorption if present. However, the highest count 
bins are heavily contaminated by target sources, resulting in 



a lower detected percentage of absorption components once 
they are removed from consideration. Fig. [6] shows the de- 
tected percentage after target removal. The highest count 
bin in this plot now suggests that 5.6 ± 3.9% of type 1 AGN 
may include an intrinsic absorption component. In the lower 
count bins (< 3200), the detected percentage of absorbed 
sources does not vary significantly suggesting that the de- 
tectability is not as heavily dependent on spectral quality as 
it is for the soft excess feature. 

The detectability of absorption in the spectra is highly 
dependent on the Nh value and the range of rest-frame col- 
umn densities found in the absorbed sources is very broad, 
10 21 to 10 23 cm . Therefore we cannot choose a single 
model to simulate which is representative of all the absorbed 
sources we detect, unlike in the case of the soft excess, and 
we do not have enough statistics in order to weight de- 
tectability curves by both A^h and z. We do attempt to 
quantify the detectability of absorption components with 
different column densities by simulating absorbed spectra at 
2 = 1. These detectability curves are shown in Fig. [6] by the 
black curves of different line style. They show that a column 
of 10 23 cm" 2 (equivalent to 2 x 10 22 cm~ 2 at z = 0) would be 
detected in most spectra with > 200 counts, whereas a col- 



Detectability of spectral components 7 



A Edd ranj 


;e a 


No. of sources 
(spectra) 


Total 
Counts 


Model 


*> 


1" 


kT 

(keV) 


Normalisation 
Ratio 


Percentage 
with SE 


-2.3 to - 


1.3 


28 (47) 


5613 


po+bb fix A 
po+bb fix B 


685/498 (1.376) 
570/452 (1.261) 


1.8 (fixed) 
1.8 (fixed) 


15+ 016 
u - io -o.io 

n io+0.04 


0.008 + 0.7 
0.005 


45 ± 12 


-1.3 to - 


1.0 


32 (52) 


8383 


po+bb fix A 
po+bb fix B 


821/660 (1.244) 
657/609 (1.079) 


1.8 (fixed) 
1.8 (fixed) 


25+ 03 
u - zo -0.02 

n 9 o+0.02 

u - zo -0.02 


0.048 ± 0.004 
0.04 


75 ± 16 


-1.0 to + 


0.2 


26 (36) 


6536 


po+bb fix A 
po+bb fix B 


546/500 (1.092) 
472/465 (1.015) 


1.8 (fixed) 
1.8 (fixed) 


n 1s +0.04 

16+ 004 
u - 1D -0.03 


0.05 + 0.01 
0.084 


83 + 21 



Table 3. The samples created based on Eddington ratio values and the results of the joint spectral fitting. a AEdd = l°g (-^bol/^Edd) 



umn of ~ 3x 10 21 cm" 2 (equivalent to 5x 10 20 cm~ 2 at z = 0) 
is not strong enough to be detected in spectra of this quality 
at z = 1. For the highest count bin (> 3200 counts), the de- 
tectability curves for all but the lowest level of JVh shown in 
Fig. [5] are at 100%. This means that we expect to be sensi- 
tive to all reasonable levels of JVh and therefore our intrinsic 
percentage estimate is robust. The fraction of sources with 
particular JVh levels is roughly constant in both different z 
and count bins (the 2 properties being correlated), and hence 
the slight decrease in the detected percentage between the 
top bin where we expect to detect all levels of A?h and the 
bottom bins is what is expected when objects with lower Ah 
and/or higher redshifts are no longer detectable. 



These simulations do not include a soft excess compo- 
nent which could also reduce the detectability of any ab- 
sorption present. We investigate this by simulating sources 
with both the standard soft excess parameters and 2 values 
of ]Vh (shown by the lines of open circles in Fig. [6]). For log 
JVh = 22 our sensitivity drops by approximately 10% at low 
count levels (~ 200), increasing to ~ 25% at higher count 
levels (~ 1000), making little difference to our conclusions. 
In the case of log Nn = 21.5, at low count levels (< 1000) we 
are mostly insensitive to the absorption anyway, such that 
the inclusion of a soft excess makes little difference to its 
detectability. At higher counts levels (~ 10,000) where the 
statistics are better, including the soft excess can reduce our 
sensitivity to the absorption component by ~ 65%. 




Absorption 

— log N„=23 (22.2) - 

— log N„=22.5 (21.7) - 

— log N„=22 (21.2) 
■■■■ log N H =21.5 (20.7) 



2=1 



10 2 10 3 10 4 10 5 

Counts 

Figure 6. This figure shows how the detected percentage of an 
intrinsic cold absorption component varies with the number of 
counts in the X-ray spectra (red, solid line). It reproduces Fig. 
[2] but the sources which were the target of an XMM-Newton ob- 
servation have been removed. As a result of the reduced number 
of sources this leaves, the original top three bins are combined. 
The percentage appears to remain constant at ~ 3% for spectra 
with 75-32,000 counts, suggesting the detectability is not as heav- 
ily dependent on the spectral quality as is the case for the soft 
excess. The black lines represent simulated detectability curves 
for the detection of absorption components with different column 
densities (shown by the different line styles) in simulated spectra 
at z = 1. The equivalent JVh value at z = is quoted in brackets 
on the figure. The open circles show the detectability of absorp- 
tion of log TVh = 22 and log TVh = 21.5, when a soft excess of 
typical shape and size is also included in the spectra. 



In £13 .21 a joint fitting was carried out on groups of low 
count spectra to see if the soft excess feature could be re- 
covered. Similarly, we fit the 'apo+bb' model to the same 
samples to see if an absorption component can be recovered 
in addition to the soft excess already found to be present. 
We implement the model with F fixed to 1.8, kT fixed at 0.2 
keV and leave both the power law and blackbody normal- 
isations free to vary. The best-fitting Ah values are listed 
in Table [4] We find that an absorption component can be 
recovered in the low count spectra, but only in the lower 
redshift bins is this component constrained. Although the 
Ah values are of the order of those seen in single object 
fits in the Sll sample, the range in column densities means 
that the values we obtain here merely represent an 'average' 
Ah value, the exact value of which should be treated with 
caution. 



z range AT H (xl0 22 cm 2 ) 



0.21- 


-0.55 


57+ - 27 
U -°'-0.15 


0.56- 


-0.77 


0.831,; 22 


0.77- 


-0.98 


< 0.43 


0.98- 


-1.12 


< 0.17 


1.12- 


-1.19 


< 0.53 


1.19- 


-1.28 


< 0.61 



Table 4. Results from fitting the 'apo+bb' model to the groups 
of low count spectra described in Table [2] In each case, V and kT 
are fixed at 1.8 and 0.2 keV respectively. 90% errors are quoted 
on the Nn value. 
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5 DISCUSSION 

The origin of the soft excess emission is still a matter of 
debate. Since this feature has been shown to be ubiquitous 
in the X-ray spectra of type 1 AGN, any models to explain 
its origin must be applicable to all sources. Some theories 
of the soft excess describe it as an 'apparent' feature, rather 
than an additional component in the spectrum. A strong 
jump in opacity at ~ 0.7 keV is created by lines and edges 
of ionised Ovn and Ovm which are smeared by the high ve- 
locities or gravitational redshifts found close to a black hole. 
This can appear from absor ption through optically t hin ma- 
terial in the line of sight (|Gierliriski fc Donell2004l ) or via 
reflect ion from optically thick material out of the line of 
sight dRoss fe Fabianll2005l ; ICrummv et ai1l2006r >. More re- 
cently, [DOTie_et_al. I (|2012r i have suggested that the soft excess 
may be intrinsic emission from the disc, which is shifted into 
soft X-ray energies due to the required colour-temperature 
correction, and further Compton upscattering produces the 
full components observed. However, this only applies to the 
lowest mass/highest accretion rate AGN and since we find 
the soft excess to be present not only in high accretion rate 
sources, this suggests two separate interpretations for the 
soft excess are required. An alternative theory, is that part 
of the soft X-ray emission may be due to cooling outflows 
which are now thoug ht to be reasonably common in AGN 
ijTombesi et al.ll2010r i. During Eddington accretion episodes, 
high velocity (v ~ 0.1c) and highly ionised (£ ~ 10 4 ) winds 
are produced and when these interact with the inters tellar 
medi um of the host galaxy, the gas is strongly shocked (|Kind 
2010). Subsequent Compton cooling of this gas may be ob- 
servable as a s oft X-ray component, as s uggested in the case 
of NGC 4051 (jPounds fc Vaughan|[2oTll 'l. 

The ubiquity of the soft excess means that any X-ray 
spectral fitting of type 1 AGN must take this feature into 
account. It has been shown that leaving this component un- 
modelled can lead to a V value ~ 0.1 too steep. In addition, 
any attempt to constrain an intrinsic Nh value must also in- 
clude the blackbody component in the fit since they appear 
in the spectra at a similar energy range. It was found in Sll 
that the average V values for sources fit with the 'po+bb' 
model were significantly flatter than those fit with the 'po' 
model (r po = 1.98 ± 0.01, r po+bb = 1.87 ± 0.05, KS signif- 
icance = 0.0003) suggesting that the underlying power-law 
slope in sources with a soft excess is different. However, our 
modelling uses a blackbody at low energies to model the soft 
excess component and this is purely phenomenological - it 
provides a good representation of the feature seen in spectra 
of our quality. If the soft excess is actually a broad spectral 
feature as suggested by reflection models, our modelling may 
not be fully accounting for the spectral complexity and the 
power-law slope at higher energies could still contain some 
of this component. 

This work and many in the literature, have confirmed 
the presence of a population of AGN which are classi- 
fied as type 1 due to the presence of broad emission lines 
in their UV/optical spectra, but also show significant X- 
ray absorption. Conversely there are also objects optically 
classified as type 2 whi c h are unabsorbed in X-rays (e.g. 
iPanessa fc Bassani|[200l . iMateos et al.ll2005al lbl'l. These ob- 
jects are not reconciled by the standard orientation based 
Unified Model which invokes an obscuring torus to both 



block the line of sight to the broad line region and give 
X-ray absorption in the case of type 2 AGN, but not type 
1. Constraining the fraction of absorbed type 1 sources may 
aid in interpreting these objects in terms of a correction to 
the Unified Model. Such a correction may inclu de invoking 
the 'clumpy torus' model (|Nenkova et a l. 2008a b) in which 
the torus consists of individual clouds. Observations of AGN 
which showed large amplitude and rapid variability of the 
X-ray column density were interpreted as occultations by 
clouds in the broad-line region, suggesting clumpy absorbers 
may be present at a range of different scales. In this model 
observing a given source as absorbed depends upon the cov- 
ering factor of the clouds a nd is a probability issue rather 
than one of just orientation IjRisaliti et al.ll2002l ). It has also 
been suggested that obscuration of AGN c ould occur due 
to the presence of a warped accretion disc jGreenhill et al.l 
120031 ; lNavakshinll2005l ; lLawrence fc Elvisll2010h . In this sce- 
nario there is no need for an obscuring torus; the type 2 
objects tend to be the ones with larger misalignments, giv- 
ing larger covering factors. 

Whilst it would be interesting to compare how the de- 
tection of the low energy spectral components varies with 
radio loudnesf], we lack the statistics to do this. However 
we do note that 6/75 radio loud quasars (RLQ) have a de- 
tected soft excess and a joint fit of 29 sources with a total of 
~ 7000 counts gives similar soft excess parameters to those 
found for the radio-quiet objects. The prevalence and mag- 
nitude of soft-excesses thus appears very similar for RQQ 
and RLQ, contrary to previous suggestions that RL Q do 
not include this component, (e.g. ISulentic et al"1l2010r ). 



6 CONCLUSIONS 

In this paper we have simulated X-ray spectra consisting 
of a power law and a soft excess, in which the shape of 
the later co mponent represent s a typical example of those 
found in the lScott et alj ()201 lh sample. The spectra were fit 
in XSPEC over the energy range 0.5-12.0 keV and an F-test 
was used at 99% significance to test whether the compo- 
nent was statistically required in the fit. By repeating this 
procedure for spectra with different numbers of counts, max- 
imum detection curves were generated and compared to the 
observed results from the real data sample presented in Sll. 
The effect of redshift on the detectability was also taken 
into account. Despite the raw percentage of sources with a 
soft excess being ~ 8%, we showed that after correcting for 
the spectral quality, the intrinsic percentage is 75 ± 23%. 
This suggests that within the Sll sample, almost all of the 
sources could include a soft excess component with a shape 
and size typical of that seen in the highest count spectra, 
and it is merely the quality of the spectra that is limiting 
our ability to detect them. The detectability of a soft excess 
component is dependent on both the blackbody temperature 
and the size of its normalisation with respect to the underly- 
ing power law used in the modelling, but we find that using 
slightly different values for either does not change our overall 
conclusion. 



4 The determination of the radio loudness of the sources is de- 
scribed in Sll. 
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If soft excesses are ubiquitous, then the feature should 
be recovered in a combination of low count spectra. Groups 
of ~ 50 spectra (~ 7000 counts) were created in narrow z 
bins, including spectra with < 500 counts and which had 
no previous evidence for additional spectral features. The 
groups at z < 1 were shown to be better fit with a model in- 
cluding a soft excess, and the temperature and normalisation 
with respect to the underlying power law of the components 
required were consistent with those found in individual high 
count spectra. 

We are unable to conduct a simulation procedure in 
order to determine the percentage of type 1 AGN which re- 
quire an intrinsic cold absorption component. However, we 
suggest that its detectability may not be as dependent upon 
spectral quality as the soft excess. We stress that a non- 
negligible percentage, ~ 5%, of type 1 AGN may include 
such an absorption feature and therefore any spectral mod- 
elling must take the possibility of this feature into account. 
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